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Abstract 

We present an analysis of the effects of global topology on the structural stability of folded 
proteins in thermal equilibrium with a heat bath. For a large class of single domain proteins, we 
computed the harmonic spectrum within the Gaussian Network Model (GNM) and determined the 
spectral dimension, a parameter describing the low frequency behaviour of the density of modes. 
We find a surprisingly strong correlation between the spectral dimension and the number of amino 
acids of the protein. Considering that larger spectral dimension value relate to more topologically 
compact folded state, our results indicate that for a given temperature and length of the protein, the 
folded structure corresponds to the less compact folding compatible with thermodynamic stability. 
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INTRODUCTION 



The role of geometry has recently been considered as a factor of primary importance for 
the study of several physical properties of proteins and other biological macromolecules. In 



particular, since the topolo^ 
of the protein 



of folded states is known to influence the folding properties 
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a great deal of work has been devoted to the study 



of those theoretica 



folded proteins. 
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aspects which describe the networks of links between amino acids in 



10 



111 ]. Furthermore, relevant features of protein conformations seem to 
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follow the geometrical principles of the optimal packing problem \VA 
concepts from graph theory have been interestingly applied to identify flexible and rigid 
regions of folded states 

Starting from the primary, linear structure (the sequence of amino acids), a protein 
evolves during the folding process until it reaches a final state (native state) whose geo- 
metrical shape is crucial to the function of the protein itself. However, the problem of the 
geometrical arrangement of proteins in their native states cannot be regarded as purely 
static issue. Indeed, a massive accumulation of experimental data collected from X-ray, 
NMR and neutron spectroscopy, has revealed that protein native states are rather dynamic 
structures where amino acids constantly move around their equilibrium positions. This mo- 
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is usually examined and investigated 
or essential dynamics jlq . However, the study 



tion, crucially involved in protein functions 
through normal modes analysis (NMA) 
of collective motions of larg e scale proteins is generally difficult due to limited access to 
realistic all- atoms NMA 19j and simplified or approximate approaches are usually welcome. 
Tirion jstj first proposed the possibility of replacing, in protein normal mode computations, 
complicated empirical potentials by Hookian pairwise interactions depending on a single 
parameter. This approach stems from the observation that low-frequency dynamics, which 
are mainly associated with protein-domain motion, are generally insensitive to the finer 
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details of atomic interactions. Much of the subsequent literature |2lL l2z 
has confirmed the success of simple harmonic models in the study of slow vibrational dy- 
namics of large biological macromolecules, and they have become a viable alternative to 
heavy and time-consuming all-atoms NMA. This success result from the striking agreement 
of predictions with experiments, the presence of few adjustable parameters and the fast and 
easy numerical implementation on computers and fast result production. For these reasons 
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harmonic models are also utilized for the systematic analysis of large data sets of proteins. 

The topological stability of macromolecules is far from being a pure mechanical problem 
as it closely involves thermodynamics. Indeed, the relevant thermodynamic potential that 
must be minimized in order to find the stable configuration is not energy, but free energy. 
This is due to the interaction with the environment (schematized as a thermal bath) which 
is generally not negligible, especially for biological macromolecules that have a stable phase 
in a solvent. In particular, water is a very efficient medium for the transfer of thermal energy 
at microscopic scales (i.e. oscillations and molecular rotations). 

With these considerations in mind, in this work we apply NMA to investigate the influence 
of the global native state topology on the thermal stability of proteins. 

Vibrational thermal instability is a well-known topic of study in solid state physics. Since 
the initial classical analysis of Peierls j^, it has been recognized that equilibrium with 
a thermal bath can dramatically influence the possible topological arrangement of large 
geometrical structures. Up to now, the most striking consequence of Peierls' instability has 
concerned low-dimensional crystals: for one and two-dimensional lattices the mean square 
displacement of a single atom at finite temperature diverges in the thermodynamic limit, i.e. 
with an increasing number of atoms. When the displacement exceeds the order of magnitude 
of the lattice spacing, the topological arrangement of the lattice is unstable and the crystal 
becomes a liquid. For real structures, formed by a finite number of units and far from the 
thermodynamic limit, the divergence sets a maximal stability size, which is negligible for 
one-dimensional lattices and typically mesoscopic for two-dimensional lattices. 

However, thermal instability is present not only in crystals but also in structurally in- 
homogeneous systems, such as glasses, fractals, polymers and non crystalline structures. 
Here, the problem is much more complex. Generalizing the Peierls approach to mesoscopic 
disordered structures, we are able to apply this kind of argument to the thermal stability of 
macromolecules. In this article we describe how this can be done in the case of proteins; we 
predict the existence of a critical stability size depending on a global topological parameter 
(the spectral dimension) and compare our predictions with experimental data. 
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THEORY 



In a recent paper |28( generalizing the Peierls' result, we showed that a thermodynamic 
instability also appears in inhomogeneous structures and is determined by the spectral di- 
mension d. The parameter d is defined according to the asymptotic behaviour of the 
density of harmonic oscillations at low frequencies. More precisely, using g(uj) to denote the 
density of modes with frequency lu, then 

g(u) ~ J- 1 (1) 

for uj — > 0. The spectral dimension is the most natural extension of the usual Euclidean 
dimension d to disordered structures as far as dynamical processes are concerned. It coin- 
cides with d in the case of lattices, but in general, it can assume non-integer values between 
1 and 3. The spectral dimension represents a useful measure of the effective connected- 
ness of geometrical structures at large scales, because large values of d correspond to high 
topological connectedness. Moreover, it characterizes not only harmonic oscillations, but it 
also relates to diffusion, phase transitions and electrical conductivity allowing a variety of 
both experimental and numerical methods for its determination |3dl. l31|. The relevance of 
d in connection with the anomalous density of vibrational modes in proteins has also been 
considered in refs. 

In the case of thermal instability, we demonstrated that, for d > 2, the mean square 
displacement (r 2 ) of a structural unit (being an atom, a molecule or a supra-molecular 
structure according to the studied case) of a system composed of N elements, diverges in 
the limit iV — > oo. Using T to denote the temperature of the heat bath, with the 
Boltzmann constant, and with 7 the interaction energy scale, the divergence is given by the 
asymptotic law: 

7 

when d < 2. When d = 2, the mean square displacement diverges logarithmically, (r 2 ) ~ 
hsT /^/\n(N), as in the case of the Peierls' result for a two dimensional crystal. Notice 
that the divergence in (r 2 ) is only determined by d. Now, at any given temperature T, 
there will exist a threshold value N(T) beyond which (r 2 ) 1 / 2 exceeds the typical spacing 
between the nearest neighbors, making the solid structure unstable. Therefore at large 
enough values of N, the solid will experience a structural reorganization which can lead 
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either to a homogeneous liquid phase at sufficiently high temperatures or to a disordered 
3-dimensional solid, which is homogeneous on a large scale and inhomogeneous at a small 
scale. In general, the threshold values of N are very small with respect to the typical order 
of magnitude of macroscopic systems, being rather comparable to the size of large complex 
macromolecules such as biopolymers. 

This poses an intriguing question concerning proteins. Indeed, to exploit their biological 
function proteins must keep a specified geometrical and topological arrangement and cannot 
afford any, even partial, large scale geometrical fluctuations such as it happen, to swollen 
polymeric chains in a good solvent |34| . This makes thermodynamical stability crucial and 
suggests a possible correlation between the spectral dimension and the length of protein 
chains. 

Vibrational stability in proteins has been analysed with the Gaussian network model 
(GNM), proposed by Bahar et al. I35j and widely applied because it yields results in agree- 
ment with principal X-ray spectroscopy experiments. This approach generally considers 
proteins as elastic networks, whose nodes are the positions of the alpha-carbons (C a ) in the 
native structure and the interactions between nodes are assimilated to harmonic springs. 
The only information required to implement the method is the knowledge of the native 
structure, and two parameters are introduced, the spring constant and the interaction cut- 
off, which, however turn out to be related whenever the model is applyed to fit experimental 
data. The GNM can be defined by the quadratic Hamiltonian 

N 2 

i ij 

where the first term is the kinetic energy of the system, 7 being the strength of the springs 
that are assumed homogeneous, Rj and <5rj indicating the equilibrium position and the 
displacement with respect to R« of the z-th C a atoms. The model is eventually defined by 
the contact matrix A with entries: Ay = 1 if the distance |Rj — Rj| between two C a 's, in 
the native conformation, is below the cutoff Rq, while is otherwise. 

The harmonic spectrum for each structure is given by the set of eigenvalues ...,uj^} 
of the Kirchhoff matrix (or valency-adjacency matrix) = —Ay + <5y Yli^i ^iz, 

Notice that the first eigenvalue u\ vanishes and corresponds to the constant eigenvector 
related to the trivial uniform translation. 

The comparison between experimental data and GNM results is obtained via the X-ray 



crystallographic 5-factors, measuring the mean square fluctuation of C a atoms around their 
native positions 

with (•) indicating the thermal average. In the GNM approximation, this average is easily 
carried out, because amounts to a Gaussian integration, and B-factors can be expressed in 
terms of the diagonal part of the inverse of the matrix T [35! ]: 

The knowledge of the eigenvectors and eigenmodes of matrix T allows to compute the 
GNM B-factors also through formula 

8n 2 k B T y \u t (k)\ 2 

7 k ^ 

where % is the residue index, the sum runs over all non-zero frequencies u>i and Ui(k) indicates 
the i-th component of the k-th eigenmode. 

The comparison with crystallographic data is crucial for setting the correct values of the 
parameters Ro end 7. (see. Methods and Results). 



METHODS 



We present a GNM harmonic analysis performed over the dataset of protein native struc- 
tures with different sizes downloaded from the Brookheaven Protein Data Bank. The pur- 
pose of the analysis is basically to investigate whether there exists a correlation between the 
spectral dimension of native structures and the length of natural occurring proteins and, 
if so, to verify whether the correlation can be explained in terms of the above mentioned 
stability criterion determined by equation (J2J). 

Our representative statistical sample, listed in Tables HI and ITT] was selected according to 
the following criteria. First, we only considered proteins with a stable large scale geometry. 
This excludes multiple domains proteins, where domains can undergo relative motion giving 
rise to larger geometrical fluctuations. Moreover, we considered only proteins not binded to 
fragments of DNA, RNA or other substrates because such structures cannot be described 
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with sufficient accuracy in terms of simple harmonic model with only two effective parame- 
ters. Finally, we choose proteins covering uniformly a wide length interval ranging from 100 
to 3600 to test our prediction. 

The diagonalization of the Kirchhoff matrix T to obtain its eigenvalues {uf, ...,u%} and 
eigenvectors has been performed with the standard numerical packages [36]. 

The value of the interaction cutoff for generating the contact matrix A has been set to 
Ro = 7 A as customarily in such kind of studies. The cutoff choice, which affects the overall 
GNM performance, is generally tested through the correlation coefficient p 37 1 

= £i(i?i -(£))(*<-(*» (4) 



between experimental (Xi) and theoretical (Bi) B-factors. The sum runs of over the number 
of protein residues, and and (X), (B) indicate the average values. Our data set contains 
only those protein structures with a coefficient p greater than 0.5 (see last column of Tables|l] 
and|n]) this should, in principle, ensure that GNM correctly reproduces C a fluctuations 
for each selected protein. However since we shall study two different cutoffs, we decided to 
include even those few structures, such as 9RNT, 1A47, and 1CDG, that have a p > 0.5 
for one cutoff and p < 0.5 for the other one. The few instances of the agreement between 
B-factors from GNM and crystallography are shown in figure ^ where we display the best 
and the worst cases with respect to the coefficient p. 

For each protein, the optimal value of the spring constant 7 was obtained through a 
least-square fitting to the experimental B-factors expressed by formula 

k B T _ 1 TnBjXj ( 
7 8** J2i Bf [ J 

The values of ksT/^, besides being an essential ingredient for the real application of GNM 
method, are also an indication of the protein global flexibility and allows for a direct com- 
parison among all the considered structure. 

The spectral dimension d was estimated via a power-law fitting of the low frequency 
behaviour of the cumulated density of modes G(u), namely the integral of g(ui). Indeed, due 
to relation G(u) ~ to at small arguments (see Fig. 12}. The harmonic spectrum, obtained 
within the GNM, for three proteins with sizes, small, medium and large, respectively is 
plotted in figure |2l where low-frequency regions clearly exhibit the power law behavior 
whose exponent is the spectral dimension d. 
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FIG. 1: Comparison between experimental B-factors and mean square fluctuations of C a by GNM, 
for the structures 1A47 (lowest correlation) and 3PTE (highest correlation) at cutoff Rq = 7 A, and 
structures 9RNT (lowest correlation) and 3PTE (highest correlation) at cutoff Rq = 6A. Heavy 
solid line refers to crystallographic data, while t§in and dashed lines refers to GNM approximation. 
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FIG. 2: Log-log plot of GNM-harmonic spectrum referred to three proteins with different sizes, 
1IQQ (N=200), 1KCW (N=1017) and 1K83 (N=3494). On vertical axis, we report the cumulated 
distribution G(u) of vibrational modes. Low frequencies regions clearly exhibit a power-law be- 
haviour, and dashed lines indicates the best-fits of the power-law whose exponent is the spectral 
dimension. 

RESULTS AND DISCUSSION 

Our statistical analysis for the whole dataset of proteins and cutoffs i?o = 7 A is sum- 
marized in Table HJ where we report the spectral dimension and its corresponding error, 
the estimate for and finally the correlation coefficient. To test the robustness of our 

results, we have repeated the same analysis at a slightly different cutoff, namely at Rq = 6A, 
which yields a smaller but still good correlation between experimental and theoretical B- 
factors fTab. lTT|) . Errors on d- values, in both tables, were estimated to cover the uncertainty 
due to the choice of the fitting region for the power-law, because the slope of the linear-fitting 
(see Fig. |2J) can change even sensibly upon varying this region. Furthermore, error bars take 
into account also correlation data (p) which indicates how GNM can faithfully reproduce 
the low-energy deformations of a given protein structure. 

The relationship (j2J) establishes a rather strong constraint between the spectral dimension 
and the maximum size N max of a protein can afford. Since, the stability is supposed to fail 
when the fluctuation (r 2 ) 1 / 2 becomes of the same order of magnitude of the mean distance 



9 



between non consecutive amino acids (about 7 A) , one can assume that 

2 - b 

2~ 1 + HN max )- (6) 

The proportionality constant b depends on the mean amino acid spacing, on the spring 
elastic constant 7 and temperature T. However, this dependence is expected to be very 
weak (i.e. only logarithmic) and this allows for a comparison of different proteins without 
the computation of the specific parameters. It should be stressed that equation (0), being 
based uniquely on thermodynamics stability, can be actually regarded as an upper bound 
prediction only. 

Figure B verifies the prediction drawn form the thermodynamical stability argument and 
shows the final result of our analysis. We plot the quantity 2/d versus 1/ ln(iV) as suggested 
by relation (JHJ): indeed, if Eq. © holds, we should obtain a straight line crossing the y-axis 
at 1 for zero abscissa. As matter of fact, our data are well fitted by a straight line, but, with 
an offset with respect to the equation (0) 

For case of a cutoff Rq = 7A, best-fit values of the parameters are a = 0.63, b = 2.61, with 
a correlation coefficient 0.73. For the cutoff -Ro = 6A, we obtained the values a = 0.63, 
b = 3.40. with a correlation 0.72. Interestingly, the linear behaviour predicted by Eq. © 
is confirmed for two different cutoffs with a correlation larger than 0.7, providing a strong 
evidence of the robustness of the result. 



CONCLUSIONS 



We applied the Gaussian Network Model (GNM) to investigate the influence of native 
state topology on thermodynamical stability for a set of folded proteins with a very different 
sizes, ranging from 100 to 3600. Employing the GNM is appropriate in this type of study 
because such a model correctly accounts for the topological features of the native protein 
conformations. Our results show that the spectral dimension d, which is sensitive to the 
large scale topology of a geometrical structure, is one parameter governing the low-energy 
fluctuations of a given protein structure. As a consequence, one can derive an instability 
criterion for proteins, based only on topological considerations, which is the analogous of 
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FIG. 3: Linear plot showing the dependence of the spectral dimension on protein size. The dashed 
line, indicating the behaviour [71 is a best fit with a correlation coefficient 0.73 and 0.72 for cutoffs 
7 A and 6A respectively. 

Peierls' criterion developed for ordered crystalline structures. The criterion easily predicts a 
non-trivial logarithmic dependence of the spectral dimension on the length of a protein. This 



further confirms the lack of universalit y fo r the spectral dimension of proteins 



already addressed in previous studies 



an issues 



24 1 . We verified that such a logarithmic dependence 



is really observed, within statistical and systematic errors, for the whole set of selected 
proteins. Furthermore, the dependence is robust because it applies even with alteration 
of the interaction cutoff which is the most critical parameter to the GNM applicability. 
We can conclude that the relation between spectral dimension and length of proteins is 
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not an artifact due to a particular cutoff choice, providing that a significant correlation is 
maintained between experimental and theoretical B-factors. We verified that, at a larger 
cutoff, the scaling behaviour Q is preserved, although the spectral dimension grows due to 
the increase in the average connectivity of the elastic network. 
The result expressed by Eq. ((7J) deserves some comments. 

Equation (JJJ) is in agreement with the upper bound represented by Eq. ©, supporting 
the relevance of topological thermal instability as a constraint to protein geometry. More 
importantly, not only is the upper bound satisfied, but the experimental points lie on a 
straight line parallel to the upper bound line of Eq. (0). This suggests a more fundamental 
role of topological stability: the protein tends to arrange topologically in such a way to 
reach the minimum value compatible with stability constraints. In other words, for any 
fixed length, it tends to the most swollen state which remains stable with respect to thermal 
fluctuations. 

An interesting point is the meaning of the offset a — 1 which would be according 
to Eq. (JHJ). Its positive value could have different explanations, but its universal nature 
(it is a "protein-independent" because is a global shift) must be due to a very general 
mechanism. A rather obvious reason is the contribution of anharmonic interactions at finite 
temperatures; a more intriguing one could be an effective longer range interaction due to the 
presence of bound water molecules around the external amino acids, which could change the 
effective form of the interaction matrix A. This hypothesis is also suggested by the physical 
interpretation of b as an anomalous dimension exponent, typically related to a renormalized 
interactions However, the most intriguing evidence relies on the regression coefficient 
Independently of the physical origin of b, its high value strongly supports the existence of a 
thermodynamic stability threshold, dependent on the topology of the folded state, for the 
size of proteins. 
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1.90 


0.04 


0.524 


0.546 


1A54 


321 


1.86 


0.03 


0.601 


0.516 


1A0I 


332 


1.71 


0.03 


1.109 


0.826 


3PTE 


347 


1.79 


0.01 


0.366 


0.840 


1A26 


351 


1.82 


0.01 


1.369 


0.635 


1BVW 


360 


1.87 


0.02 


0.652 


0.639 


8JDW 


360 


1.94 


0.01 


1.293 


0.607 


70DC 


387 


1.92 


0.01 


0.859 


0.620 


lOYC 


399 


1.93 


0.01 


1.056 


0.697 


1A39 


401 


1.97 


0.01 


1.113 


0.656 


16PK 


415 


1.82 


0.03 


0.630 


0.590 


1DY4 


441 


1.88 


0.02 


0.785 


0.614 


1BU8 


446 


1.95 


0.01 


0.859 


0.632 


1AC5 


483 


1.87 


0.01 


1.091 


0.709 


1LAM 


484 


1.97 


0.01 


0.488 


0.583 


1CPU 


495 


1.92 


0.02 


0.620 


0.729 


3COX 


500 


1.92 


0.02 


0.491 


0.670 


1A65 


504 


2.09 


0.01 


1.042 


0.606 


1SOM 


528 


2.00 


0.02 


1.585 


0.653 


1E3Q 


532 


1.97 


0.01 


1.577 


0.623 


1CRL 


534 


2.00 


0.01 


0.969 


0.652 


1AKN 


547 


1.87 


0.01 


1.737 


0.667 


1CF3 


581 


2.01 


0.03 


1.154 


0.639 


1EX1 


602 


2.01 


0.03 


1.193 


0.598 


1A14 


612 


2.10 


0.09 


0.865 


0.524 


1MZ5 


622 


2.02 


0.04 


0.750 


0.705 


1CB8 


674 


1.92 


0.02 


1.164 


0.630 


1HMU 


674 


1.92 


0.02 


0.907 


0.684 


1A47 


683 


2.02 


0.04 


0.646 


0.529 


1CDG 


686 


1.98 


0.02 


1.074 


0.593 


1DMT 


696 


1.96 


0.02 


1.204 


0.536 


1A4G 


780 


1.98 


0.03 


0.591 


0.567 


1HTY 


1014 


2.07 


0.05 


0.646 


0.766 


1KCW 


1017 


2.05 


0.03 


2.130 


0.638 


APP1 


1021 


1.93 


0.02 


0.805 


0.576 


1KEK 


2462 


2.07 


0.05 


1.263 


0.730 


1B0P 


2462 


2.08 


0.09 


0.319 


0.810 


1K83 


3494 


2.01 


0.01 


2.030 


0.659 


1I3Q 


3542 


1.97 


0.01 


2.435 


0.758 


1150 


3558 


1.98 


0.02 


2.236 


0.701 



TABLE I: List of processed native protein structures from Brookheaven PDB, with their length, 
the corresponding spectral dimension estimateglgby GNM approach with cutoff Rq = 7A, error on 
its determination, parameter ksT/j and correlation p @. 



PDB code 


Length 


d 


Error 


KbT/j 


Correl. 


9RNT 


104 


1.43 


0.08 


0.209 


0.549 


1BVC 


153 


1.36 


0.02 


0.186 


0.626 


1G12 


167 


1.61 


0.04 


0.412 


0.599 


1AMM 


174 


1.55 


0.09 


0.113 


0.802 


4GCR 


185 


1.48 


0.09 


0.002 


0.689 


1KNB 


186 


1.70 


0.01 


1.104 


0.699 


1CUS 


197 


1.71 


0.03 


0.453 


0.693 


1IQQ 


200 


1.57 


0.02 


0.199 


0.625 


2AYH 


214 


1.68 


0.01 


0.222 


0.756 


1AE5 


223 


1.66 


0.02 


0.396 


0.537 


1LST 


239 


1.59 


0.01 


0.441 


0.707 


1A06 


279 


1.52 


0.01 


0.907 


0.621 


1NAR 


289 


1.54 


0.01 


0.257 


0.731 


1A48 


298 


1.46 


0.01 


0.235 


0.546 


1A3H 


300 


1.71 


0.01 


0.342 


0.414 


1SBP 


309 


1.62 


0.03 


0.301 


0.718 


1A5Z 


312 


1.57 


0.02 


0.914 


0.539 


1A1S 


313 


1.70 


0.01 


0.523 


0.643 


IADS 


315 


1.56 


0.02 


0.204 


0.611 


1A40 


321 


1.57 


0.01 


0.199 


0.604 


1A54 


321 


1.57 


0.03 


0.232 


0.543 


1A0I 


332 


1.60 


0.01 


0.492 


0.799 


3PTE 


347 


1.66 


0.01 


0.180 


0.840 


1A26 


351 


1.60 


0.03 


0.602 


0.613 


1BVW 


360 


1.73 


0.02 


0.297 


0.527 


8JDW 


360 


1.71 


0.01 


0.550 


0.537 


70DC 


387 


1.54 


0.01 


0.301 


0.586 


lOYC 


399 


1.74 


0.01 


0.472 


0.659 


1A39 


401 


1.78 


0.02 


0.473 


0.643 


16PK 


415 


1.67 


0.04 


0.277 


0.591 


1DY4 


441 


1.84 


0.02 


0.357 


0.535 


1BU8 


446 


1.76 


0.01 


0.331 


0.538 


1AC5 


483 


1.60 


0.02 


0.482 


0.646 


1LAM 


484 


1.75 


0.01 


0.204 


0.623 


1CPU 


495 


1.67 


0.01 


0.235 


0.546 


3COX 


500 


1.72 


0.01 


0.202 


0.571 


1A65 


504 


1.86 


0.03 


0.421 


0.701 


1SOM 


528 


1.63 


0.02 


0.560 


0.610 


1E3Q 


532 


1.67 


0.02 


0.570 


0.533 


1CRL 


534 


1.81 


0.01 


0.448 


0.648 


1AKN 


547 


1.71 


0.01 


0.800 


0.641 


1CF3 


581 


1.73 


0.05 


0.473 


0.560 


1EX1 


602 


1.73 


0.01 


0.401 


0.544 


1A14 


612 


1.86 


0.02 


0.373 


0.538 


1MZ5 


622 


1.68 


0.02 


0.274 


0.740 


1CB8 


674 


1.66 


0.02 


0.425 


0.627 


1HMU 


674 


1.67 


0.01 


0.334 


0.652 


1A47 


683 


1.75 


0.01 


0.236 


0.376 


1CDG 


686 


1.76 


0.01 


0.402 


0.454 


1DMT 


696 


1.73 


0.09 


0.484 


0.549 


1A4G 


780 


1.99 


0.09 


0.244 


0.554 


1HTY 


1014 


1.70 


0.07 


0.267 


0.739 


1KCW 


1017 


1.82 


0.01 


0.918 


0.581 


APP1 


1021 


1.83 


0.05 


0.299 


0.572 


1KEK 


2462 


1.90 


0.04 


0.439 


0.664 


1B0P 


2462 


1.94 


0.02 


0.118 


0.695 


1K83 


3494 


1.90 


0.04 


0.781 


0.631 


1I3Q 


3542 


1.94 


0.02 


0.883 


0.691 


1150 


3558 


1.96 


0.05 


0.816 


0.653 



TABLE II: List of processed native protein structures from Brookheaven PDB, with their length, 
the corresponding spectral dimension estimate(|lyby GNM approach with cutoff Ro = 6A, error on 
its determination, parameter ksT/j and correlation p @. 



